Detecting Domain Dedicated Polar Words

نویسندگان

  • Raksha Sharma
  • Pushpak Bhattacharyya
چکیده

There are many examples in which a word changes its polarity from domain to domain. For example, unpredictable is positive in the movie domain, but negative in the product domain. Such words cannot be entered in a “universal sentiment lexicon” which is supposed to be a repository of words with polarity invariant across domains. Rather, we need to maintain separate domain specific sentiment lexicons. The main contribution of this paper is to present an effective method of generating a domain specific sentiment lexicon. For a word whose domain specific polarity needs to be determined, the approach uses the Chi-Square test to detect if the difference is significant between the counts of the word in positive and negative polarity documents. We extract 274 words that are polar in the movie domain, but are not present in the universal sentiment lexicon. Our overall accuracy is around 60% in detecting movie domain specific polar words.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Domain Sentiment Matters: A Two Stage Sentiment Analyzer

There are words that change its polarity from domain to domain. For example, the word deadly is of positive polarity in the cricket domain as in “Shane Warne is a ‘deadly’ leg spinner”. However, ‘I witnessed a deadly accident’ carries negative polarity and going by the sentiment in cricket domain will be misleading. In addition to this, there exist domainspecific words, which have the same pola...

متن کامل

Finding Domain Specific Polar Words for Sentiment Classification

This paper presents a method of using conditional random fields (CRF) for extracting polar words and determining the overall sentiment of text. We frame sentiment classification as a feature selection problem and conduct three sets of experiments by using: prior polarity lexicons, bag-of-words classifiers and CRF sequence models. The results show the potential of utilizing CRFs in discovering h...

متن کامل

Pothole Detection by Soft Computing

Subject- Potholes on roads are regarded as serious problems in the transportation domain and ignoring them leads to the increase of accidents, traffic, vehicle fuel consumption and waste of time and energy. As a result, pothole detection has attracted researchers’ attention and different methods have been presented for it up to now. Background- The major part of previous research is based on i...

متن کامل

Value of Dedicated Head and Neck 18F-FDG PET/CT Protocol in Detecting Recurrent and Metastatic Lesions in Post-surgical Differentiated Thyroid Carcinoma Patients with High Serum Thyroglobulin Level and Negative 131I Whole-body Scan

Objective(s): In clinical practice, approximately 10-25% of post-surgical differentiated thyroid carcinoma (DTC) patients with high serum thyroglobulin (Tg) and negative 131I whole-body scan (WBS) have poor prognosis due to recurrent or metastatic lesions after radioactive iodine treatment. The purpose of this study was to evaluate the value of 18F-FDG PET/CT scan in DTC patients with high seru...

متن کامل

Domain Adaptation for Opinion Mining: A Study of Multipolarity Words

Expression of opinion depends on the domain. For instance, some words, called here multi-polarity words, have different polarities across domain. Therefore, a classifier trained on one domain and tested on another one will not perform well without adaptation. This article presents a study of the influence of these multi-polarity words on domain adaptation for automatic opinion classification. W...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013